scikit-learn
nltk
PyPDF2
pandas
numpy
